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MUC-1 derived peptides 



The present invention relates to antigenic polypeptides of the MUC-1 protein 'which 
are able to activate Cytotoxic T Lymphocytes (CTL) response and to nucleotide sequences 
encoding such polypeptides. Furthermore, the present invention relates to vectors 
comprising such nucleotide sequences, host cells comprising the same and their use in the 
production of the antigenic polypeptides. In addition, the present invention relates to 
compositions comprising the polypeptides, nucleotide sequences, vector or host cells of the 
present invention and to therapeutic and diagnostic uses of such compositions. 

Generally speaking, there are two major types of immune response: the humoral 
response which is characterized by the production of antibodies by B lymphocytes, and the cell- 
mediated immune response. Antibodies are able to recognize antigens in their three dimensional 
form, either soluble or bound to an insoluble support such as a cell, while T cells recognize 
processed antigen fragments which sare bound and presented by glycoproteins encoded by the 
major histocompatibility complex (MHC) notably MHC-I genes which are expressed at the cell 
surface of almost all vertebrate cells or MHC-H genes which are expressed on antigen 
presenting cells (AFC). 

A cell-mediated immune response usually necessitates the cooperation of helper T 
lymphocytes and effector cells. This cooperation takes place, in particular, as a result of 
mtexieukm-2 and/or various other cytokines which are secreted by helper T lymphocytes after 
their activation by antigenic fragments presented by APC in association with MHC-H 
Cytotoxic T Lymphocytes (CTL) tare activated, induced to proliferate and to exert their 
antigen-specific cytotoxic function jupon exp sure to antigenic polypeptides completed with 
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autologous MHC-I, co-stimulatory molecules on the surface of the APC and cytokines, often 
derived from helper T ceils. T cell derived cytokines can also trigger and drive the proliferation 
and antigen processing capacity of APC as well as activation and induction of proliferation in 
other cells, including other T cells. 

Said antigen presentation by MHC-I molecules has been characterized (see for 
example Groettrup et al, 1996, Immunology Today, 17, 429-435): a full-sized protein or 
glycoprotein antigen is digested into shorter antigenic polypeptides (of about 7 to 13 amino 
acids in length). Said polypeptides are associated with MHC-I molecules and {3-2 microglobluin 
leading to a ternary complex which is further presented on the cell surface. 

O MHC-I specificity towards antigens can vary greatly depending on the 

considered MHC-I molecule (HLA-A, HLA-B, ...) and on the allele (HLA-A2, HLA-A3, 
HLA-A11, ...) since genes encoding the MHC molecules are greatly variable between 
individuals among a species (reviewed in George et al, 1995, Immunology Today, 16, 209- 
212). 

Most tumor cells express antigens at their surface which differ either 
qualitatively or quantitatively from the antigens present at the surface of the corresponding 
normals cells. These antigens are specific when they are expressed only by tumor cells. When 
they are present on both normal and tumor cells, these antigens are said to be associated with 
the tumor, in this case, they are present either in larger amounts or in a different form in the 
O tumor cells. 

It is now well known that patients suffering from a cancer may develop an 
immune response to their tumour. This has been revealed, in particular, by demonstrating that 
the serum of some patients contain anti-tumor antigen antibodies, and that their serum was 
capable of inhibiting the growth of cancer cells in vitro. Nevertheless, inasmuch as spontaneous 
tumor regressions are extremely rare, it appears that the immune response observed in vitro 
remains ineffective in vivo. 

Hellstrom et al (1969, Adv. Cancer Res. 12, 1 67-223) have shown that antigen- 
specific CTL can be effective mediators in a tumour-specific immune response. However, this 
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natural immune response is not always effective enough to limit tumor growth. Although an 
immune response may develop against a tumor, it is not known whether it is of real benefit to 
the patient Seemingly uncontrolled tumour growth would suggest that a tumor eludes the 
body's mechanisms of iraraune surveillance. Tumour-derived molecules are considered to play 
a significant part in modifying or diverting the immune response in favour of the tumor rather 
in favour of the individual. 

In the light of the complexity of the immune response against tumors and the 
modest st»fr ? of current knowledge in this field, the use of an anticancer vaccine is not obvious. 
Animal studies have shown that irnmunization using living or killed cancer cells could lead to 
rejection of a subsequent tumor graft, however attempts at immunization using acettular 
products, for example administration of the complete antigenic protein, with polypeptide 
fragments of such protein. DNA fragment encoding all or part of tumor-associated proteins, 
have generally been less successful. 

Recently, Toes et al. (1997, Proc. Natl. Acad. Sci, 94, 14660-14665) have 
developed an alternative approach based on minimal antigenic polypeptide fragments selection 
which might be specifically recognized by the CTL. According to said method, the minimal 
antigenic fragments are expressed in the host cells where they can be associated with MHC-I 
molecules and then be presented on the cell surface, inducing a specific immune reaction. 
More specifically, it has been shown that intra-cellular expression of "fflinigens" encoding very 
short epitopes (from 7 to 13 amino acids in length) can induce a cellular immune response. 
Moreover, Whitton et al , (1993, J. of Virology, 67, 348-352) have proposed the use of a 
CD vector, called "string of beads" construct, which co-expresses several minigeas and can induce 

a synergetic CTL immune response. 

Another recent and important use for such polypeptides is in association with soluble 
complexes of MHC-I, p-2 microglobulin and a floras cent or otherwise visually detectable 
reagent. These, so called "Tetramers" ( eg, as described in Altman et si, 1996, Science, 274:94- 
96) can be used to identify by flow cytometry or histology, antigen specific CTL ex vivo. 

MUC-1 is a glycosylated mucin polypeptide found on the apical surface of 
mudn-9ecreting epithelial cells in various tissues, including breast, hug, pancreas, stomach, 
ovaries, fallopian tubes, and intestine (Peat et aL 1992, Cancer Res. ;52: 1954-60 - Ho et al. 
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1993, Cancer Res.;53:641-51). Malignant transformation of breast, ovary, pancreas and 
probably other epithelial tissues, results in over expression of MUC-1 polypeptide in tumor 
cells (Hareuveni et al, 1990, Eur. Joum, Biochenx;189:475-86 ; Layton et al, 1990, Tumor 
Biol; 1 1 :274-86). In addition, abnormal glycosylation of MUC-1 polypeptide in breast, and 
probably other MUC-1 -expressing tumour cells results in the exposure of tumor-associated 
antigenic epitopes on the protein core of MUC-1 (Burchell et al, 1987, Cancer Res.;47:5476- 
82; Devine et al, 1990, J. Tumor Marker Oncol.;5:ll-26; Xing et al, 1989, ImmunCell 
Biol. :67: 183-95) as well as on the glycosyl side chains (Samuel et ai, 1990, Cancer Res.; 
50:4801-8). 



Monoclonal antibodies specific for these epitopes have been described which 
can identify more than 90% of breast and pancreatic tumors. Non major-histocompatibility- 
compiex (MHC) restricted cytotoxic T cell responses to the MUC-1 tumor specific protein 
epitope by T cells from breast and pancreatic cancer patients have also been reported (Jerome 
et al, 1991, Cancer Res.;51:2908-16) in addition to MHC restricted, MUC-1 -specific CTL 
(Reddish et al, 1995, Int J Cancer, 10:817-823). Moreover, proliferation of T cells to purified 
MUC-1 has been seen (Keydar et al, 1989, Proc.Nal Acad. Sci. USA; 86:1362-6). These 
various observations suggest that MUC-1 may be an effective target antigen for active 
immunotherapy in breast, as well as other, cancers. Hareuveni et al (1991, Vaccine; 9:61 8-27) 
expressed the MUC-1 antigen in vaccinia virus and showed that rat imnninizpri with W-MUC- 
1 rejected MUC-l-b earing tumor cells at rate of 60-80 % (Hareuveni et al, 1990, Proc. NatL 
Acad. Sci USA; 87:9498-502). 

O The present invention concerns immuno-reactive polypeptides identified from 

the MUC-1 polypeptide sequence and their uses in cancer therapy and diagnosis. The invention 
could also be used to follow MUC-1 specific immune responses in patients during the course 
of disease and/or treatment. The invention also concerns nucleotide sequences encoding these 
polypeptides, vectors useful for transferring and expressing said nucleotide sequences into 
target ceils, and uses of said nucleotide sequences in cancer gene therapy and diagnosis. 

In accordance with the present invention polypeptides have been identified 
wherein said polypeptide comprises a polypeptide sequence of at most 20 consecutive amino- 
acids defined in SEQ ID NO: 1, wherein said polypeptide is different from SEQ ID NO : 2 and 
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is capable of binding with at least one MHC-I molecule. 

"Capable of binding with 11 means that the considered polypeptide is capable to 
interact and to bind with MHC-I molecules. In a preferred embodiment of the invention, this 
binding results in cell surface presentation of these polypeptides by MHC class I molecules in 
order to elicit a specific 'Tipmna response or for the detection of a specific immune response, 
eg, by Tetramer analysis ( eg, as described in Altman et al, 1996, Science, 274:94-96). 

According to a preferred embodiment, said polypeptide sequence is selected 
from the group consisting in SEQ ID NO: 3 to SEQ ID NO: 33. Data to explain why these 
sequences have been selected are shown in figures 1 to 7. 

According to a first embodiment of the invention, said polypeptide presents at 
least one of the following properties : 

( a ) the polypeptide sequence is selected from the group consisting of SEQ 
ID NO:3 to SEQ ID NO:6, and said polypeptide binds the HLA A2 glycoprotein of MHC-I, 

(b) the polypeptide sequence is selected from the group consisting of SEQ 
ID NO:7 to SEQ ID NO 15, and said polypeptide binds the HLA B7 glycoprotein of MHC-L 

(c) the polypeptide sequence is selected from the group consisting of SEQ 
ID NO: 16 to SEQ ID NO: 19, and said polypeptide binds the HLA A3 glycoprotein of MHC- 
L 

(d) the polypeptide sequence is selected from the group consisting of SEQ 
ID NO: 19 to SEQ ID NO: 21, and said polypeptide binds the HLA All glycoprotein of 
MHC-I, 

( e ) the polypeptide sequence is selected from the group consisting of SEQ 
ID NO: 22 to SEQ ID NO: 25, and said polypeptide binds the HLA A24 glycoprotein of 
MHC-I, 

Q the polypeptide sequence is selected from the group consisting of SEQ 

ID NO: 26 to SEQ ID NO: 29, and said polypeptide binds the HLA Al glycoprotein of MHC- 
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(g) the polypeptide sequence is selected from the group consisting of SEQ 

ID NO: 30 to SEQ ID NO: 33, and said polypeptide binds the HLAB8 glycoprotein of MHC- 
1 

The invention is also related to polynucleotide comprising a nucleic acid 
sequence encoding at least one polypeptide of the present invention. According to a preferred 
embodiment, the nucleic acid sequence of the invention is selected from the group consisting 
of SEQ ID NO: 34 to SEQ ID NO: 64, and their complementary sequences. 

The term "polynucleotide" as used in the scope of the present invention i pw«n^ 
a DNA and/or RNA fragment, single or double-stranded, linear or circular, natural or synthetic, 
modified or not (see US-A-5,525,71 1, US-A-4,71 1,955, US-A-5,792,608 or EP-A-0 302 175 
for modification examples) defining a fragment or a portion of a nucleic acid, without size 
limitation. It may be, inter alia, a genomic DNA, a cDNA, an mRNA. '"Polynucleotides" and 
"nucleic acids" are synonyms with regard to the present invention. The nucleic acid may be in 
the form of a linear polynucleotide, and preferably in the form of a plasmid. A wide range of 
plasmids is commercially available and well known by one skilled in the art. These available 
plasmids are easily modified by the molecular biology techniques (Sambrook et al, 1989, 
Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York). 
Plasmids derived from pBR322 (Gibco BRL), pUC (Gtbco BRL), pBhiescript (Stratagene), 
pREP4, pCEP4 (Invitrogen) and also p Poly (Lathe et al., 1987, Gene 57, 193-201) are 
illustrative of these modifications. According to the invention, the nucleic acid can be a naked 
polynucleotide (Wolff et al., Science 247 (1990), 1465-1468) or is formulated with at least one 
compound such as a polypeptides, preferably viral polypeptides, oligonucleotides or canonic 
lipids, or canonic polymers which can participate in the uptake of the nucleic acid into the cells 
(see Ledley, Human Gene Therapy 6 (1995), 1129-1144 for a review) or a protic polar 
compound (examples are provided below in the present specification or in EP-A-0 890362). 
"Polynucleotide" also designates nucleic acid of viral origin (viral vector) which encodes at 
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least for the polypeptide f the invention. Such viral vector preferably derives from a virus 
selected among poxvirus (vaccine virus, MVA, canarypox....), adenovirus, retrovirus, herpes 
virus, alpha virus, foamy virus or adeno associated virus. Said viral vectors and their uses are 
widely disclosed in gene therapy literature. 

Preferably, said nucleic acid includes at least one therapeutically useful gene 
sequence that can be transcribed and translated to generate a polypeptide of interest and the 
elements enabling its expression. The genetic information necessary for expression by a target 
cell comprises all the elements required for transcription of DNA into RNA and, if necessary, 
for translation of mRNA into a polypeptide. Transcriptional promoters suitable for use in 
various vertebrate systems are well known. For example, suitable promoters include viral 
promoters like RSV, MPSV, SV40, CMV or 7.5k, vaccinia promoter, inducible promoters, 
tissue specific promoters, synthetic promoters, etc or combination thereof. The nucleic acid can 
also include intron sequences, targeting sequences, transport sequences, sequences involved 
in replication or integration. Said sequences have been reported in the literature and can be 
readily obtained by those skilled in the art The nucleic acid can also be modified in order to 
be stabilized with specific components as spermine. 

The introduction or transfer process of an anionic substance of interest into a 
cell is by itself well known. "Introduction or transfer" means that the polynucleotide is 
transferred into the cell and is located, at the end of the process, Inside said cell or within or 
on its membrane. It is also called "transection" or "infection" depending of the nature of the 
vector. 

The invention is therefore further directed to a vector, of viral or plasmid 
origin, comprising at least a nucleic acid sequence of the invention. 
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According to a preferred embodiment, the vector of the invention comprises one 
or more nucleotide sequences selected from the group consisting of: 

the sequences encoding a polypeptide as defined in (a) in addition to one or 
more of the sequences encoding a polypeptide as defined in (b), (c), (d), 
(e),(f)or(g), 

the sequences encoding a polypeptide as defined in (b) in addition to one or 
more of the sequences encoding a polypeptide as defined in (a), (c), (d), 
(e),(i)°r(g), 

the sequences encoding a polypeptide as defined in (c) in addition to one or 
more of the sequences encoding a polypeptide as defined in (a), (b), (d), 
O (*),(*) or (g), 

the sequences encoding a polypeptide as defined in (d) in addition to one or 
more of the sequences encoding a polypeptide as defined in (a), (b), (c), 
(e),(f)or(g), 

the sequences encoding a polypeptide as defined in (e) in addition to one or 
more of the sequences encoding a polypeptide as defined in (a), (b), (c), 
(d),(f)or(g), 

the sequences encoding a polypeptide as defined in (f) in addition to one or 
more of the sequences encoding a polypeptide as defined in (a), (b), (c), 
© <<0,<e)or(g), 

the sequences encoding a polypeptide as defined in (g) in addition to one or 
more of the sequences encoding a polypeptide as defined in (a), (b), (c), 
(d),(e)or(f). 
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The invention also encompasses a pharmaceutical composition which are 
particularly useful for the delivery of polynucleotides of the invention to cells or tissues of a 
subject in the scope of a gene therapeutic method , especially in case of cancer treatment The 
term "gene therapy method" is preferably understood as a method for the introduction of a 
polynucleotide into cells either m vivo or by introduction into cells in vitro followed by re- 
implantation into a subject. "Gene therapy" in particular concerns the case where the 
polynucleotide is expressed in a target tissue, especially tissue comprising cell expressing 
MHC-I molecules. 

Preferably, the pharmaceutical composition furthermore comprises a 
pharmaceuticauy acceptable carrier or diluent. The carrier or diluent are non toxic to recipients 
at the dosages and concentrations, employed. Representative examples of carrier or diluent for 
^ injectable solutions include water, isotonic saline solutions which are preferably buffered at the 

physiological pH (such as phosphate buffered saline or Tris-buffered saline), marmitol, 
dextrose, glycerol and ethanol, as well as polypeptides or protein such as hum a n serum 
albumin. This carrier or diluent is preferably isotonic, hypotonic or weakly hypertonic and has 
a relatively low ionic strength. Furthermore, it may contain any relevant solvents, aqueous or 
partly aqueous liquid carriers comprising sterile, pyrogen-free water, dispersion media, 
coatings, and equivalents. The pH of the pharmaceutical preparation is suitably adjusted and 
buffered. 

The invention more particularly pertains to a pharmaceutical composition 
comprising at least one of the complexes described above and also incorporating at least one 
G adjuvant capable of improving the transfection capacity of said complex. Adjuvants may be 

selected in the group consisting in a cMoroquine, protic polar compounds such as propylene 
glycol, polyethylene glycol, glycerol, EtOH, 1-methyl L -2-pyroHdine or their derivatives, or 
aprotic polar compounds such as climethylsulfoxide (DMSO), diethylsulfoxide, di-n- 
propylsulfbxide, dimethylsulrone, sulfolane, dimemylformamide, dimethylacetamide, 
tetramethylurea, acetonitrile or their derivatives. 
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In preferred embodiment, the polynucleotide which is contained in the pharmaceutical 
composition is a DNA Other particular embodiments of the invention are pharmaceutical 
compositions wherein said polynucleotide is naked, associated with viral polypeptides or 
completed with cationic components, more preferably with cationic lipids. In general, the 
concentration of polynucleotide in the pharmaceutical compositions is from about 0.1 jig/ml 
to about 20 mg/mL 

The pharmaceutical composition in accordance with the present invention can 
be administered into a vertebrate tissue. This administration may be made by intradermal, 
subdermal, intravenous, intramuscular, intranasal, intracerebral, intratracheal, intraarterial, 
intraperitoneal, intravesical, intrapleural, intracoronary or intratumoral injection, by means of 
a syringe or other devices. Transdermal administration is also contemplated, as are inhalation, 
aerosol routes, instillation or topical application. 



According to the present invention, the pharmaceutical composition can be 
adjninistered into target tissues of the vertebrate body including those of muscle, skin, brain, 
bug, liver, spleen, bone marrow, thymus, heart, lymph, bone, cartilage, pancreas, kidney, gall 
bladder, stomach, intestine, testis, ovary, uterus, rectum, nervous system, eye, gland, 
connective tissue, blood, etc. In a preferred embodiment, said composition will be administered 
into tumor. 

Administration of such a composition to a patient allows to elicit an immune 
response based on the activation of cytotoxic lymphocytes by the polypeptides encoded by said 
nucleotide sequences. The composition of the invention is particularly suitable for the treatment 
of MUCM -expressing cancers, such as breast cancer, ovary cancer, pancreas or lung cancer. 

According to the invention, "cells" include prokaryote cells and eukaryote cells, 
yeast cells, plant cells, human or animal cells, in particular mammalian cells. In particular, 
cancer cells should be mentioned. The invention can be applied m vivo to the interstitial or 
luminal space of tissues in the lungs, the trachea, the akin, the muscles, the brain, the liver, the 

10 
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heart, the spleen, the bone marrow, the thymus, the bladder, the lymphatic system, the blood, 
the pancreas, the stomach, the kidneys, the ovaries, the testicles, the rectum, the peripheral or 
central nervous system, the eyes, the lymphoid organs, the cartilage, the end thelium. In 
preferred embodiments, the cell will be a muscle cell, a haematopoietic system stem cell or an 
airways cell, a tracheal or pulmonary cell, or a tumor cell. 

The present invention also encompasses a process for transferring a nucleic acid 
into cells wherein said process comprises contacting said cells with at least one polynucleotide 
according to the invention. This process may be applied by direct administration of said 
polynucleotide to cells of the animal in vtvo, or by in vitro treatment of cells which were 
recovered from the animal and then reintroduced into the animal body (ex vivo process). In 
in vitro application, cells cultivated on an appropriate medium are placed in contact with a 
suspension consisting of polynucleotide of the invention. After an incubation time, the cells are 
washed and recovered. Introduction of the polynucleotide can be verified (eventually after lysis 
of the cells) by any appropriate method. 

In the case of in vivo treatment according to the invention, in order to improve 
the transection rate, the patient may undergo a macrophage depletion treatment prior to 
administration of the pharmaceutical preparations described above. Such a technique is 
described in the literature (refer particularly to Van Rooijen et al., 1997, TibTech, IS, 178- 
184). 

The invention further concerns the use of a polypeptide or a polynucleotide , 
a vector or a host cell as defined above for the preparation of a pharmaceutical composition 
intended for diagnostic, curative, preventive or vaccination treatment of man or animals, and 
more specifically for the treatment of cancer. 
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The invention is also related to a diagnostic composition comprising at least 
one polypeptide as defined above. The use of a polypeptide of the invention in a diagnostic 
composition is illustrated by the following processes: 

a process which enables the detection and eventually the quantification of an antibody 
directed against said polypeptide consists in (i) contacting with said polypeptide 
a biological sample susceptible to containing said antibody and (ii) detecting the 
formation of an immune complex between said antibody and said polypeptide. 

a process which enables the detection and eventually the quantification of MUC-l- 
specific T lymphocytes according to the ELISPOT technique (Scheibenbogen 
et al, 1997, Clinical Cancer Research, 3, 221-226); Tetramer analysis ( eg, as 
described in Altaian et al, 1996, Science, 274:94-96) or other techniques which 
allow the identification of specific T cells by virtue of specificity of their T cell 
receptor for the polypeptides of this invention. 

The methods, compositions, uses of the invention can be used for the treatment 
of all kinds of cancer the treatment and/or diagnostic of which being related to or dependent 
on the immune properties of the polypeptides of the invention. The compositions, and uses of 
the present invention may be desirably employed in hwm»qp t although animal treatment is also 
encompassed by the uses described herein. 

These and other embodiments are disclosed or are obvious from and 
encompassed by the description and examples of the present invention. Further literature 
\_- ? concerning any one of the methods, uses and compounds to be employed in accordance with 

the present invention may be retrieved from public libraries, using for example electronic 
devices. For example the public database "Medline" may be utilized which is available on 
Internet, e.g. under http jVwww. ncblnlm rnhgov/PubMed/irie dlineJitml. Further databases and 
addresses, such as http://www.flcbi.nlmnih.gov/, 

http://www.infobiogen.fr/, http://www.mii.di/bioiogy/research_tools.html, 
http://www.tigr.arg/, are known to the person skilled in the art and can also be obtained using, 
e.g., http://wwwJycos.com. An overview of patent information in biotechnology and a survey 
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of relevant sources of patent information useful for retrospective searching and for current 
awareness is given in Berks, TIB TECH 12 (1994), 352-364. 

The invention has been described in an illustrative manner, and it is to be understood that the 
terminology which has been used is intended to be in the nature of words of description rather 
than of limitation. Obviously, many modifications and variations of the present invention are 
possible in light of the above teachings. It is therefore to be understood that within the scope 
of the appended claims, the invention may be practised otherwise than as specifically described. 
Accordingly, those skilled in the art will recognize, or able to ascertain using no more than 
routine experimentation, many equivalents to the specific embodiments of the invention 
described specifically herein. Such equivalents are intended to be encompassed in the scope of 
the following claims. 



© 



The figures show: 



Figures 1 to 7 show competition binding data from polypeptides whose sequences are from 
within the sequence of human MUC- 1 . Experiments were performed according to the method 
described in van der Burg et al (Hum. Immunol. 1 995, 44: 1 89- 1 98). Polypeptides described 
as pp x 1 correspond to the SEQ ID NO. For example, pp 27 corresponds to SEQ ID NO 27. 
In some figures the competion binding curves of some negative (therefore not claimed) 
polypeptide sequences are shown to demonstrate the specificity of the competition binding 
assay. 
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Figures 8, 9 and 10 show ELISpot data from three experiments performed with PBMC, from patients 
immunized with W-MUC-1-IL2, exposed to polypeptides from this iirventioa Spots per I0e6PBMC 
indicate the number of CD8+ (CTL) T lymphocytes, per milUon PBMC, which are specific for that 
polypeptide. Black histograms represent the ELlSpot responses of PBMC drawn from the patient 1 
weeks after the injection of W-MUC-1-IL2 (Figure 8) , 4 weeks after injection (Figure 9) or 4 weeks 
after the second injection (Figure 10). The white bars correspond to the ELISpot response of patient 
PBMC taken before W-MUC-1 administration (Figure 8) 5 months after injection (Figure 9) or before 
the second injection (Figure 10) 

The examples illustrate the invention. 
EXAMPLES 

The following examples show: 



1) The identification of MUC-1 polypeptides which bind specifically to molecules of the human 
Major Histocompatibility Complex I (MHC-I); and 

2) The utility of these polypeptides in a functional bioassay, known as ELISpot (Enzyme Linked 
Tmmunn Spot) assay. 



1) Competition Binding A^nv 



INTRODUCTION: Polypeptide fragments 8 - 13 amino acids long, of proteins produced within a 
nucleated, vertebrate cell are associated with newly formed cellular proteins of the MHC-I complex The 
complex of MHC-I protein and polypeptide fragment are further associated with a protein known as 
Beta-2 microglobulin. This trimolecular complex is then transported to the cell surface, anchored to the 
cell membrane and exposed to the extracellular milieu. Thymus-derived, or T lymphocytes of the CDS 
category have specific 'antigen' receptors on their cell surface, which recognise the MHC-I-Beta-2- 
microgobulin-peptide complex. Individual CD 8+ T cells or clonal progeny of an individual T cell 
precursor express on their cell surface antigen receptors which recognise only one (or very few) such 
polypeptides within the context of the MHC-I-Beta-2-microglobulin-peptide complex. This is known 
as 'Antigen Specificity 1 of T lymphocytes. If the polypeptide is derived from a normal or 'self protein, 

14 . 
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T lymphocytes are not 'stimulated 1 due either to the deletion of self-specific T cells from the immune 
repertoire or due to negative regulation of self-specific immune responses. 1£ however, the polypeptide 
is from a pathogenic organism, such as a virus, then specific T cells are activated to proliferate and to 
become cytotoxic such that CD8 'cytotoxic effector cells' or CTL specifically recognise the infected cell 
and eliminate that cell in an effort to contain the pathogenic condition Tumours can also produce 
'tumour-specific' protein molecules or modifications of cellular proteins. In those cases, specific CDS 
T cells can recognise a tumour cell as pathogenic and eliminate these cells by the same mechanism as 
is used to eliminate virus-infected cells. Often, the tumour-specific modification of a protein is merely 
quantitative in that a protein is over-produced in tumour cells. It has been shown that such proteins can 
also be recognised by specific Cytotoxic T Lymphocytes (CTL). For example see Disis et al, Cancer 
Research, 54.: 1071-1076 (1994); orBamd et al, ProcNatl Acad SdUSA, 86: 7159-7163 (1989). 



It has been shown in numerous publications that the binding of said polypeptides to MHC-I 
molecules depends upon certain 'motifs' of amino adds at defined positions within the polypeptides. For 
example, the amino adds Leucine at position 2 and Valine at position 9 of a nine amino add polypeptide 
will result in the binding of that polypeptide to HLA-A2. For a review, see Rammensee et al, 
Immunogeiietics, 4J; 178-228 (1995). The knowledge of the required amino acid positions had been 
acquired by the extraction of polypeptides from MHC-I molecules and sequencing them. In addition to 
the 'anchoring' residues there are various other 'preferred' flanking amino adds, such that a polypeptide 
can be given a 'rank' of likelihood that it will bind to a particular MHC-I molecule, depending on its 
sequence. Such ranking of predicted binding of polypeptides can be determined by one of several 
computer programs. In accordance with the invention, the program BIMAS' has been consulted 
(Biolnformatics & Molecular Analysis. Section) HLA POLYPEPTIDE Binding Predictions' 
(http://binias.dcrt.iiib.gov/molbio/hla_bind/) for predictions of which polypeptides from the human 
MUC-1 sequence are likely to bind to various HLA types. This, of course, is only a computer prediction 
and binding must be ascertained with a biochemical assay. Then, whether the polypeptides selected by 
the binding assay must be tested in a biological assay. 

About 200 of the top ranking MUC-l polypeptides, as predicted by the BIMAS program, 
predicted to bind to HLA-A1, A2, A3, Al 1, A24, B7 and B8 were produced (NeoSytem, Strasbourg, 
France) and were screened for HLA binding by a competitive b in d in g assay. This assay is described in 
van der Burg et al, Human Immunology, 44:189-198 (1995). Briefly, EBV-transformed B lymphocyte 
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cell lines, of a known HLA-type, are exposed to a polypeptide known to bind to that HLA type. Binding 
of the polypeptide is determined by flow cytometry using a Fluorescence Activated Cell Sorter. Binding 
can b envisioned with this apparatus since the polypeptide known to bind is tagged with a flourescene 
molecule. Thus cells binding the polypeptide become fluorescent Each polypeptide to be screened for 
binding is mixed together with the reference fluorescent polypeptide, which is at a constant 
concentration of ISO nM The test polypeptide is added at various concentrations and the mixture 
exposed to the same cells. A test polypeptide is considered 'positive' if the binding of the reference 
polypeptide is inhibited by 50 % at 20 ug/ml or less of the test polypeptide. In figures I - 7 are shown 
data for the competitive binding of polypeptides deemed, by this assay, to be positive for binding to 
HLA-A2, B7, A3, Al 1, A24, Al, and B8 respectively. In each case, binding is compared to a negative 
control polypeptide, known not to bind to that HLA type and a positive control polypeptide, known to 
bind to that HLA type. For example, the positive control. "Flu' is a polypeptide with the sequence 
";< GILGFVFTL from the influenza virus matrix protein (Scheibenbogen et al International Journal of 
Cancer, 71: 932-936 (1997). 

JviATERIALS: 

EBV-B cell lines were derived by cultivating human Peripheral Blood Mononuclear Cells (PBMC) in 
filtered culture supernatant which had been used to grow cells from the marmoset line B-958 These cells 
produce the Epstein Barr Virus, PBMC are cultured for 2-3 days in the presence of 1 ug/ml Cyclosporin 
A (to inhibit T cell reactivity to the virus) in the B-958 supernatant, then thereafter cultured in fresh 
culture medium. HLA-typad EBV-transfbrmed human B cell lines were used for all tests HLA types are 
described in Table I. 

Culture merihmi was Dulbecco's Modified Eagle's Medium (DMEM) + 5xlCT s M Beta-Mereaptoethanol 
(with the addition of 25 mM HEPES buffer for the step of polypeptide detachment and cell washing) 
plus either 2% or 10% Fetal bovine Serum All tests were performed in 96 well, V-bottom microtitre 
plates (PS micro plate), p-2 microglobulin was purchased from Sigma. The buffer for the polypeptide 
detachment used was: 

13.76 g citric acid (MMUC-1 210.14 g/L) 
5.43 gNaaHPO,. 2H,0 (MMUC-1 177.9 g/L) 
in 500 ml distilled H 2 0 

pH initially adjusted to pH 4.0, but pH then re-adjusted depending upon which HLA type is being tested 
(see Table T). 
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Dulbecoo's Phosphate Buffered Saline (PBS) was purchased as a powder from Sigma. Reference 
polypeptides labelled with flourescene at the cysteine residue (Table I) were prepared according to Van 
deBurg et aL, 1995, Hum, Immunol., 44, 189-198. Test polypeptides wefe purchased fromNeoSystem, 
Strasbourg or prepared according to standard methods. Positive control polypeptides were purchased 
from NeoSystem (Strasbourg, France). Details of the polypeptides used are described in Table I. 

Plastics, unless otherwise indicated, were purchased from Corning. 
METHODS: 

Cells were cultivated in Coming T175 flasks in 20 of culture medium (10% FBS). The night before the 
assay, cells were re-suspended and 10 ml of fresh medium added The day of the test cells were 
resuspended, counted, pelleted by centri&gation and resuspended in 5 ml complete medium with 10% 
FBS distribute cells into a 6 well plate, 10 5 cells per well in 5 ml culture medium. Cells were then 
~) cultured for 4 hours at 37°, 5% CO*. During that time polypeptides were prepared by dilution in 600 
Hg/ml in PBS. Two-fold serial dilutions from 600 - 4.68 (to have a final dilution of 100 to 0.78 ug/ml 
inthe test plate) were first prepared in a separate 96 well 'polypeptide dilution' plate. 

The test plate (96 well, V-bottom) was prepared as follows: 

- negative control (no polypeptide): 50 ul PBS 

• - positive control (reference polypeptide only): 25 ul PBS + 25 ul H-reference polypeptide 
-tests: 25 ul H-reference polypeptide at 1 50 nM (final) then 25 ul of test polypeptides (including 
positive and negative control polypeptides) at their various dilutions were added. Plates were then 
placed in a refrigerator in the dark. 

After the 4 hour incubation of cells, the following were prepared, on ice: 

- Two 15 ml conical bottom test tubes containing culture medium with 2% FBS. 

- One 15 ml conical bottom test tube containing 10 ml culture medium with 2% FBS and 
including 1.5 ug/ml 0-2 microglobulin 

- One 15 ml conical bottom test tube containing 2 ml acid 'peptide detachment buffer at the pH 
for the particular HLA type as indicated in Table L 
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Cells in the 5 ml culture medium in the 6 well plate were resusp ended and transferred to a 15 ml conical 
bottom test tube and then centrifiiged for 5 minutes at 1500 rpm (500g). Resusp end the cells in PBS and 
centrifuge a second time (500g). Supernatant was removed and 2 ml detachment buffer while on ice. 
Cells were resuspended by gentle pipetting curing the first 30 seconds of this 2 minute period. After 2 
minutes, 14 ml culture medium with 2% FBS was added. Cells are mixed by inverting the tube twice, 
then centrifuged at 2000 rpm (800g) for 3 minutes at 4°C. Supernatant was removed and cells 
resuspended in 14 ml cold culture medium with 2% FBS and centrifugation repeated (3 minutes at 
800g). Supernatant was removed and cells gently resuspended in 14 ml culture medium, 2 % FBS and 
1 .5 ug/ml 3-2 microglobulin. One hundred ul cells from this suspension were added to each well of the 
96 well plate, which already contained toe polypeptides. The plate was wrapped in Saran Wrap and left 
24 hours at 4°C. The next day, plates were centrifuged at 1000 rpm (200g), supernatant removed and 
cells resuspended in 100 ul PBS containing 0. 1% Bovine Serum Albumin (BSA) and 0.02% sodium 
and cells pelleted by centrifugation at 200g. This step was repeated once more, then cells were 
resuspended in 1% paraformaldehyde and analysed for fluorescence by a FACScan (Beet on Dickenson, 
Mountainview California). 



The mean fluorescence intensity (MET) of cells with the fluorescent reference polypeptide but with no 
competitor polypeptide (positive control) was taken as 0% inhibition. Similarly, the MFI of cells without 
the fluorescent reference polypeptide (negative control) was taken to be equal to 100 % inhibition. 
;e inhibition was calculated as: 

(MFI with test polypeptide) - (MFI negative control) 

% inhibition = ( 1- )X10Q 

! (MFI with test polypeptide) - (MFI positive control) 



High Affinity Binding was taken to be 50 % inhibition at < 10 uM polypeptide (~10ug/ml) 
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RESULTS: 

The competition for binding to selected HLA types between serial dilutipns of selected peptides and the 
reference peptides (as described in Table X) are shown in Figures 1-7. Binding of polypeptides, from the 
sequence of human MUC-1, to HLA -A2, B7.A3, All, A24, Al and B8 are shown in Figures 
1,2,3,4,5,6, and 7 respectively. High affinity binding sequences were often, but not always, within the 
top 20 predicted binding polypeptide sequences as predicted by the BIMAS HLA Peptide Motif 
program (as described above). 



O 



2) ELISoot: 

The ELISpot is a technique which allows the identification of antigen-specific (in this case, MUC-1- 
specific) T cell recognician by the detection of antigen induced production of cytokines (TFNy, TNFct, 
IL-4, etc..) following an antigenic stimulation in yitro. More particularly, ELISpot allows the 
detexmination of the number of antigen specific T lymphocytes in a population of peripheral blood 
mononuclear cells (PBMC) (Scheibenbogen et aL 1997, Iat J. Cancer 71-1). In this case, the production 
of IFNy produced by CD 8+ T cells (CTL) in responses to polypeptides as presented by autologous HLA 
molecules were examined. 

Briefly, in an ELISpot, the cytokines are captured between two specific antibodies. The first 
antibody, specific for human IFNy, is adsorbed on a nitrocellulose membrane. Lymphocytes from human 
blood samples are added to the microtitre wells containing the attached antibody. Antigen, in the form 
of polypeptides, is also added to the wells. The principal is that polypeptides will attach to cell surface 
HLA molecules (together with 0-2 microglobulin). Polypeptide specific T cells will recognize the 
complex of the polypeptide:HIA:0-2ugtobulin. Upon recognition of antigen, the T cells become 
'activated' to produce cytokines such as IFNy. Secreted IFNy is then captured by the antibody which 
is attached to the nitrocellulose. Cells are washed away leaving behind the areas of secreted IFNy. These 
areas are revealed by the second antibody (coupled to biotin) and then by a streptovidin-alkaline 
phosphatase conjugate. The enzyme substrate hydrolysis by the enzyme leads to a spot appearance. 
Thus each spot represents the 'fingerprint' of a cytokine producing cell. The tests described below were 
performed using a commercially available kit (MABTECH, Nacka, Sweden) 
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MATERIALS 

Peripheral blood mononuclear cells : 

In Figures 8 and 9 patient PBMC (Peripheral Blood Mononuclear Cells) were obtained from breast 
cancer patients who had participated in a Phase I clinical trial carried out in the Institut Curie, Paris. In 
Figure 10, patient PBMC came from prostate cancer patients who had participated in a similar 
immunotherapy Phase I trial in Los Angeles, USA In these trials, patients were immunized with a 
Vaccinia virus construct which expresses, upon infection, the bothMUC-1 and IL2. The goal was to 
generate an immune response to MUC-1 which is an antigen over-expressed on both types of cancers. 
PBMC were isolated from peripheral blood by Hypaque-Ficol density centrifugation and resulting 
mononuclear cells frozen in aliquotes of 2 to 4x10 s cells in a 1 ml volume of culture medium containing 
10% DMSO and stored in the vapour phase of liquid nitrogen until use. 

© TABLE H 



Patient Identification Code 


Cancer 


BXA-Type J 


Pt#4SOM 


Advanced Breast 


HLA-A 01/02; B 44/08 J 


Pt#5LEC 


Advanced Breast 


HLA-A 02/24; B 07/44 J 


| A002 


Prostate 


HLA-A01/26; B38/08 1 



Tissue typing : 

^ Breast cancer patient PBMC was HLA typed by serology and FCR at the Etablissement de 

Transfusion Sanguine, Strasbourg. Prostate cancer patients' PBMC were HLA typed by PCR at 
Transgene using the 'One Lamda 1 HLA typing kit (One Lamda, CanogaPark, CA, USA) 

Polypeptides : 

Polypeptides were produced at NeoSytem (Strasbourg, Prance) 
EUSPOT: 

The EUSPOT kit was purchased from and used according the instructions of MABTECH 
(Nacka, Sweden). The technique was carried out according to the manufacturer's instructions. 



Briefly, PBMC were cultured in 96 well microtitre plates for 48 hours in the presence of test 
or control polypeptides at 5 ug/ml and recombinant JL-2 at 30 units/mi IFNy spots were revealed with 
a second antibody reagent; also specific for Human Interferon gamma, according to the manufacturer's 
instructions. 

RESULTS: 

The results from three experiments are shown in Figures 8, 9 and 1 0, Patient PBMC were taken 
from liquid nitrogen and thawed the day before the ELISpot assay. The controls and the polypeptides 
(numbered according to their SEQ ID N°) were added as described above. Duplicate or triplicate wells 
containing 1- 2 x 10 s PBMC were used. The number of spots was determined and is represented as the 
number of spots per 1 0 s cells. 
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These data shown in figure 8 show that PBMC from patient #4 (who is HLA-A2) are able to 
respond to the polypeptide of SEQ ID NO: 4 in that PBMC from this patient are stimulated to produce 
IFNy in response to the presence of this polypeptide but not in the presence of the negative control 
polypeptide or polypeptide SEQ ID N° 3. The response is seen after vaccination (black histograms), but 
not before (white histograms). In figure 9 is shown the results of an experiment in which PBMC from 
patient #5 (HLA-A2 and B7) are stimulated to produce IFNy ELISpots upon exposure to polypeptide 

4 (SEQ ID N° 4) and polypeptide 1 0 (SEQ ID N° 10) but not to the negative control or to polypeptides 
3 or 7. No PBMC from prior to vaccination were available, but the patient's T cell response, as 
determined by in vitro CD4+ T cell proliferation assay, to a longer (24 aa) MUC-1 polypeptide was 
discernable only in the weeks following vaccination but was undetectable 5 months afterwards. The 
transient nature of the T cell responses is verified, in Figure 9 in that only PBMC taken 28 days after 
vaccination (black histograms) were able to produce ELISpots over background, whereas PBMC taken 

5 months after injection produced no ELISpot response to these peptide (white histograms). 

These examples demonstrate the value of the invention in the diagnosis of a CD8+T cell imtmm« 
response to MUC-1. 

The invention could also be used in other diagnostic applications such as Tetramer analysis in 
which soluble MHC-I, beta-2-microglobulin and polypeptides of this invention are completed together 
with a florescent reagent The complex is then used to flourescentiy tag T cells with an antigen receptor 
specific for that polypeptide. The quantification of the specific T cells is accomplished with a florescence 
activated flow cytometer and can be done by one practised in the art 
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CLAIMS 

1 . A polypeptide consisting of or comprising an amino acid sequence of at most 20 consecutive 
ammo-acids defined in SEQ ID NO 1, said polypeptide binding at least one MHC-I glycoprotein, 
with the proviso that said polypeptide is different from SEQ ID No: 2. 

2. The polypeptide according to claim 1, wherein the amino acid sequence is SEQ ID NO 3 to SEQ 
ID NO 33, 

3. The polypeptide according to claim 1 or 2, 'wherein the amino acid sequence is selected from the 
group consisting of: 

(a) SEQ ID NO 3 to SEQ ID NO 6, and said polypeptide binds the HLA A2 glycoproteins 
of MHC-I; 

Q (b) SEQ ID NO 7 to SEQ ID NO 15, and said polypeptide binds the HLA B7 glycoproteins 

of MHC-I; 

(c) SEQ ID NO: 16 to SEQ ID NO: 19, and said polypeptide binds the HLA A3 
glycoprotein of MHC-I; 

(d) SEQ ID NO 19 to SEQ ID NO 21, and said polypeptide binds the HLA All 
glycoproteins of MHC-I; 

(e) SEQ ID NO 22 to SEQ ID NO 25, and said polypeptide binds the HLA A24 
glycoproteins of MHC-I; 

(f) SEQ ID NO 26 to SEQ ID NO 29, and said polypeptide binds the HLA Al glycoproteins 
O of MHC-I; and 

(g) SEQ ID NO 30 to SEQ ID NO 33, and said polypeptide binds the HLA B8 glycoproteins 
ofMHC-L 

4. A nucleotide sequence encoding a polypeptide according to any one of claims 1 to 3 . 

5. The nucleotide sequence according to claim 4, wherein said nucleotide sequence is selected in 
the group consisting of SEQ ID NO 34 to SEQ ID NO 64, and their complementary sequences. 

6. The nucleotide sequence according to claim 4 or 5, wherein said nucleotide sequence contains 
elements allowing its expression in a host cell. 
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7. The nucleotide sequence according to claim 6, wherein said element for expression ia a host cell 
is a promoter. 

8 . The nucleotide sequence according to any one of claims 4 to 7, 'wherein said nucleotide sequence 
is associated with one or more compounds selected from trans&cting agents, stabilizing agents, 
or targeting agents. 

9. Vector comprising at least one nucleotide sequence according to claims 10 to 1 5. 

1 0. The vector according to claim 9 comprising at least two different nucleotide sequences encoding 
at least two polypeptides as defined in claim 3 . 

11. The vector according to claim 9 or 10, wherein said vector is a plasmid. 

12. The vector according to claim 9 or 10, wherein said vector is a viral vector. 

^ ' 13. A host cell comprising at least a nucleotide sequence according to any one of claims 4 to 8 or 
at least a vector according to any one of claims 9 to 12. 

14. The host cell according to claim 13, wherein said cell is a prokaryotic cell, yeast cell, or animal 
! cell such as mammal cell. 

I 

15. | A composition comprising at least a nucleotide sequence according to any one of claims 4 to 8, 
| a vector according to any one of claims 9 to 12, or a host cell according to claim 13 or 14. 

16. The composition according to claim 15, further comprising a pharmaceutical carrier. 

17. lUse of a nucleotide sequence according to any one of claims 4 to 8, a vector according to any 
^_ one of claims 9 to 1, a host cell according to claim 13 or 14 or a composition according to claim 
y -' IS or 16 for the preparation of a medicament for effecting a CTL response in a subject. 

18. A composition comprising at least one polypeptide according to any one of claims 1 to 3 . 

19. Use of at least one polypeptide according to any one of claims 1 to 3 for the preparation of a 
diagnostic or pharmaceutical composition. 
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ABSTRACT 



Described are peptides and polypeptides derived from the MUC-1 polypeptide which are able to 
activate Cytotoxic T Lymphocyte (CTL) response, nucleotide sequences encoding such peptides and 
polypeptides and therapeutic uses thereof. Moreover, indications for selecting appropriate 
antigenic MUC-1 polypeptides with reference to the HLA-type of the patient to be treated or tested 
are described. 
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SEQUENCE LISTING 

<110> TRANSGENE SA 

<120> MUC-1 derived peptides 

<130> D 2195 BP 

<140> 
<141> 

<160> 64 

<170> Patentln Ver. 2.1 

<210> 1 

<211> 1572 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> {58J . . (1542) 

<400> 1 

gaattecctg getgcttgaa tctgttctgc ccectceeca cccatttcac caccaec 57 

atg aca ccg ggc acc cag tct cct ttc ttc ctg ctg etg etc ctq aca 105 
Met Thr Pro Gly Thr Gin Ser Pro Phe Phe Leu Leu Leu Leu Leu Thr 
15 10 15 

gtg ctt aca gtt gtt aca ggt tct ggt cat gca age tct acc cca ggt 153 
Val Leu Thr Val Val Thr Gly Ser Gly His Ala Ser Ser Thr Pro Gly 
20 25 30 

gga gaa aag gag act teg get acc cag aga agt tea gtg ccc age tct 201 
Gly Glu Lys Glu Thr Ser Ala Thr Gin Arg Ser Ser Val Pro Ser Ser 
35 40 45 

act gag aag aat get gtg agt atg ace age age gta etc tee age eac 249 
Thr Glu Lys Asn Ala Val Ser Met Thr Ser Ser Val Leu Sar Ser His 
50 55 60 

age ccc ggt tea ggc tec tec ace act cag gga cag gat gte act ctg 297 
Ser Pro Gly Ser Gly Ser Ser Thr Thr Gin Gly Gin Asp Val Thr Leu 
65 70 75 80 

gec ccg gee aeg gaa cca get tea ggt tea gee gee acc tgg gga cag 345 
Ala Pro Ala Thr Glu Pro Ala Ser Gly Ser Ala Ala Thr Trp Gly Gin 
85 90 95 

gat gte ace teg gte cca gte acc agg cca gee ctg ggc tec ace acc 393 
Asp Val Thr Ser Val Pro Val Thr Arg Pro Ala Lau Gly Sar Thr Thr 
100 1 ; 0S 110 

ccg cca gee cac gat gte acc tea gec ccg gac aac aag cca gee ccg 441 
Pro Pro Ala His Asp Val Thr Ser Ala Pro Asp Asn Lys Pro Ala Pro 
115 120 125 

ggc tee acc gee ecc ccg gee cac ggt gte ace teg gee ccg gae ace 489 
Gly Ser Thr Ala Pr Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr 
130 135 140 
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agg ccg coc ccg ggc tec ace gec ccc gec gee cac ggt gtc aec teg 537 
Arg Pro Fro Pro Gly Ser Thr Ala Pro Ala Ala His Gly Val Thr Ser 
145 .150 155 160 

gec eeg gac aec agg eeg gee ccg ggc tec ace gee ccc ccg gee cac 585 
Ala Pro Aap Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro pro Ala His 
165 170 175 

ggt gtc ace teg gec ccg gac aae agg ccg gec ttg ggc tec ace gec 633 
Gly Val Thr Ser Ala Pro Asp Asn'Arg Pro Ala leu Gly Ser Thr Ala 
180 165 190 

cct eca gtc eac aat gtc aee teg gee tea ggc tct gca tea ggc tea 681 
Pro Pro Val His Asn Val Thr Ser Ala Ser Gly Ser Ala Ser Gly Ser 
195 200 205 

gec tct act ctg gtg cac aae ggc ace tct gee agg get aec aea ace 729 
Ala Ser Thr Leu Val His Asn Gly Thr Ser Ala Arg Ala Thr Thr Thr 
210 215 220 

eca gee age aag age act eca ccc age att ccc age cac cac -tct gat 777 
pro Ala Ser Lys Ser Thr Pro Pro Ser lie Pro Ser Hie His Ser Aap 
225 230 235 240 

act cct acc aec ctt gec age cat age ace aag act gat gec agt age 825 
Thr Pro Thr Thr Leu Ala Ser His Ser Thr Lys Thr Asp Ala Ser Ser 
245 250 255 

act cac cat age a eg gta cct cct etc acc tec tec aat cac age act 873 
Thr His His Ser Thr Val Pro Pro Leu Thr Ser Ser Asn His Ser Thr 
260 265 270 

tct ccc cag ttg tct act ggg gtc tct ttc ttt ttc ctg tct ttt cac 921 
Ser Pro Gin Leu Ser Thr Gly Val 5er Phe Phe Phe Leu Ser Phe His 
275 280 285 

att tea aac etc cag ttt aat tec tct ctg gaa gat ccc age acc gac 969 
lie Ser Asn Leu Gin Phe Asn Ser Ser Leu Glu Asp Pro Ser Thr Asp 
290 295 300 

tac tac caa gag ctg cag aga gac att tct gaa atg ttt ttg eag att 1017 
Tyr Tyr Gin Glu Leu Gin Arg Asp lie Ser Glu Met Phe Leu Gin lie 
305 310 315 320 

tat aaa oaa ggg ggt ttt ctg ggc etc tec aat att aag ttc agg eca 1065 
Tyr Lys Gin Gly Gly Phe Lau Gly Leu Ser Aan lie Lys Phe Arg Pro 
325 330 335 

gga tct gtg gtg gta caa ttg act ctg gec ttc cga gaa ggt ace ate 1113 
Gly Ser Val Val Val Gin Leu Thr Lau Ala Phe Arg Glu Gly Thr lie 
340 US 350 

aat gtc cac gac gtg gag aca cag ttc aat cag tat aaa acg gaa gca 1161 
Asn Val His Asp Val Glu Thr Gin Phe Asn Gin Tyr Lys Thr Glu Ala 
355 360 365 

gec tct cga tat aac ctg acg ate tea gac gtc age gtg agt cat gtg 1209 
Ala Ser Arg Tyr Asn Leu Thr lie Ser Asp Val Ser Val Ser His Val 
370 375 380 

eca ttt cct tte tct gec aag tot ggg get ggg gtg cea ggc tgg gge 1257 
Pro Phe Pro Ph Ser Ala Gin Ser Gly Ala Gly Val Pro Gly Trp Gly 
385 " 390 395 400 



2/14 



18/89/99 18:35 Pg= 3$ 

> 10.09-199$ 498941304111 EF9940W-4 



ate gcg etg ctg gtg ctg gtc .tgt gtt ctg gtt gcg ctg gec att gtc 1305 
He Ala Leu Leu Val Leu Val Cys Val Leu Val Ala Leu Ala He Val 
405 410 415 



tat etc att gec ttg get gtc tgt cag tgc ega ega aag aae tac ggg 
Tyr Leu He Ala Leu Ala Val Cys Sin Cya Arg Arg Lys Asn Tyr Gly 
420 425 430 

cag ctg gac ate ttt cca gec egg gat ace tac cat cct atg age gag 
Gin Leu Asp He Phe Pro Ala Arg Asp Thr Tyr His Pro Met Ser Glu 
435 440 445 

tae ccc acc tac cac ace cat ggg cgc tat gtg cec cct age agt ace 
Tyr Pro Thr Tyr His Thr His Gly Arg Tyr Val Pro Pro Ser Ser Thr 
450 455 . 460 

gat cgt age cec tat gag aag gtt tet gca ggt aat ggt ggc age age 
Asp Arg Ser Pro Tyr Glu Lye Val Ser Ala Gly Asn Sly Gly Ser Ser 
4S5 470 475 480 

etc tet tac aea aae cca gea gtg gca gec act tet gec aae ttg 
Leu Ser Tyr Thr Asn Pro Ala Val Ala Ala Thr Ser Ala Asn Leu 
465 490 495 

taggggcaeg tcgcectctg agctgagtgg 



<210> 2 

<211> 495 

<212> PRT 

<213> Homo sapiens 

Met°Thx Pro Gly Thr Gin Ser Pro Phe Phe Leu Leu Leu Leu Leu Thr 
1 5 10 15 

Val Leu Thr Val Val Thr Gly Ser Gly His Ala Ser Ser Thr Pro Gly 

| 20 25 30 

Glv Glu Lys Glu Thr Ser Ala Thr Gin Arg Ser Ser Val Pro Ser Ser 
35 40 • 45 

Thr Glu Lys Asn Ala val Ser Met Thr Ser Ser Val Leu Sar Ser His 
50 55 60 

Ser Pro Gly Ser Gly Ser Ser Thr Thr Gin Gly Gin Asp Val Thr Lau 
65 70 75 80 

Ala Pro Ala Thr Glu Pro Ala Ser Gly Ser Ala Ala Thr Trp Gly Gin 
85 90 95 

Asp Val Thr Ser Val Pro Val Thr Arg Pro Ala Leu Gly Ser Thr Thr 
100 105 HO 

Pr Pro Ala His Asp Val Thr Ser Ala Pro Asp Aan Lys Pro Ala Pro 
115 120 125 

Glv S r Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala pro Asp Thr 
130 135 140 

Arg Pro Pro Pro Gly Ser Thr Ala Pro Ala Ala His Gly Val Thr Ser 
145 150 155 160 
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Ala Pro Asp Thr Arg Pro Ala Pro Gly Sec Thr Ala Pro Pro Ala His 
165 170 175 

Gly Val Thr Ser Ala Pro Asp Afln Arg Pro Ala Leu Gly S r Thr Ala 
180 185 190 

Pro Pro Val His Asn Val Thr Ser Ala Ser Gly Ser Ala Ser Gly Ser 
195 200 205 

Ala Ser Thr Leu Val His Asn Gly Thr Ser Ala Arg Ala Thr Thr Thr 
210 215 220 

Pro Ala Sac Lys Ser Thr Pro Pro Ser He Pro Ser His His Ser Asp 
225 230 235 240 

Thr Pro Thr Thr Leu Ala Ser His Ser Thr Lys Thr Asp Ala Se* Ser 
245 250 255 

Thr His His Ser Thr Val Pro Pro Leu Thr Ser Ser Asn His Ser Thr 
260 265 270 

S r Pro Gin Leu Ser Thr Gly Val Sar Phe Phe Pha Leu Ser Pha His 
275 280 285 

11a Ser Asn Leu Gin Phe Asn Ser Ser Leu Glu Asp Pro Ser Thr Asp 
290 295 300 

Tyr Tyr Gin Glu Leu Gin Arg Asp He Ser Glu Mat Phe Leu Gin He 
305 310 . 315 320 

Tyr Lys Gin Gly Gly Phe Leu Gly Lau Ser Asn He Lys Phe Arg Pro 
325 330 335 

Gly Ser Val Val Val Gin Lau Thr Lau Ala Phe Arg Glu Gly Thr He 
340 345 350 

Asn Val Hie Asp Val Glu Thr Gin Phe Asn Gin Tyr Lys Thr Glu Ala 
355 360 365 

Ala Ser Arg Tyr Asn Leu Thr He Ser Asp Val Ser Val Ser His Val 
370 375 380 

Pro Phe Pro Phe Ser Ala Gin Ser Gly Ala Gly Val Pro Gly Trp Gly 
385 390 395 400 

He Ala Leu Leu Val Leu Val Cys Val Leu Val Ala Leu Ala Ha Val 
405 410 415 

Tyr Leu He Ala Leu Ala Val Cye Gin Cys Arg Arg Lys Asn Tyr Gly 
420 425 430 

Gin Leu Aap He Phe Pro Ala Arg Asp Thr Tyr His Pro Met Ser Glu 
435 440 445 

Tyr Pro Thr Tyr His Thr His Gly Arg Tyr Val Pro Pro Ser Ser Thr 
450 455 460 

Asp Arg Ser Pro Tyr Glu LyB Val Ser Ala Gly Aan Gly Gly Ser Ser 
465 470 475 460 

Leu S r Tyr Thr Asn Pro Ala Val Ala Ala Thr Ser Ala Asn Leu 
485 490 495 
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<210> 3 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (58) . . (1542) 

<400> 3 

Ala Leu Gly Ser Thr Ala Pro Pro Val 
1 5 



<210> 4 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<400> 4 

Ph Leu Ser Phe His He Ser Asn leu 



<210> 5 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<400> S 

Thr Leu Ala Pro Ala Thr Glu Pro Ala 
1 5 



<210> 6 

<211> S 

<212> PRT 

<213> Homo sapiens 

<400> 6 

Ser Leu Sec Tyr Thr Asn Pro Ala Val 
1 5 



<210> 7 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<400> 7 

Ser Val Pro Val Thr Arg Pro Ala Leu 
1 5 



1 



5 
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<211> 9 

<212> PRT 

<213> Homo sapiens 

<400> 8 

Gly Val Pro Gly Trp Gly lie Ala Leu 

1 5 



<210> 9 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<400> 9 

Ala Pro Gly Ser Thr Ala Pro Pro Ala 
1 5 



<21£» 10 
<211> 9 
<212> PRT 

<213> Homo sapiens 
<400> 10 

Ala Val Ser Met Thr Ser Ser Val Leu 
1 5 



<210> 11 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<400> 11 

Ala Val Ala Ala Thr Ser Ala Asn Leu 
1 5 



<210> 12 
<211> 9 
<212> PRT 

<213> Homo sapiens 
<400> 12 

Arg Pro Gly Ser Val Val Val Gin leu 
1 5 



<210> 13 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<400> 13 

Val Pro Gly Trp Sly lie Ala Leu Leu 
1 5 
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<210> 14 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<400> 14 

Ser Ala Ser Gly Ser Ala Ser Thr Leu 
1 5 



<210> 15 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<400> 15 

Ser Ala Gly Aan Gly Gly Ser Ser Leu 
1 5 



<210> 16 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<400> 16 

Ala Leu Ala Val Cys Gin Cys Arg Axg 

1 5 



<210> 17 
<211> 9 
<212> PRT 

<213> Homo sapiens 
<400> 17 

Gly Gin Leu Asp He Phe Pro Ala Arg 
1 5 



<210> 18 

<211> 10 

<212> PRT 

<213> Homo aapions 

<400> 18 

Thr Leu Val His Aan Gly Thr Ser Ala Arg 
15 10 



<210> 19 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<4 00> 19 
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Thr Thr Leu Ala Ser His Sar Thr Lys 



<210> 20 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<400> 20 

Gly Val Thr Ser Ala Pro Asp Thr Arg 
1 5 



<210> 21 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<400> 21 

Sly Val Thr Ser Ala Pro Asp Asn Arg 
1 5 



<210> 22 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 22 

He Tyr Lys Gin Gly Sly Phe Leu Gly Leu 
15 10 



<210> 23 
<211> 10 
<212> PRT 

<213> Homo sapiens 
<400> 23 

Phe Phe Lau Ser Phe His He Ser Asn Leu 
IS 10 



<210> 24 
<211> 10 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (58} . . (1542) 

<400> 24 

Sly Ph Leu Gly Leu Sar Asn He Lys Phe 
15 10 



1 



5 
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<210> 25 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 25 

Tyr Tyr Gin Glu Leu Gin Arg Asp II a Ser 
15 10 



<210> 26 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<400> 26 

lie Ser Glu Met Phe Leu Gin He Tyr 
1 5 



<210> 27 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<400> 27 

Aap Val Glu Thr Gin Phe Aan Gin Tyr 

1 5 



<210> 28 

<211> 10 

<212> PRT 

<213> Komo sapiens 

<400> 28 

Ser Leu Glu Asp Pro Ser Thr Asp Tyr Tyr 



<210> 29 
<211> 10 
<212> PRT 

<213> Komo sapiens 
<400> 29 

He Ser Glu Met Phe Leu Gin lie Tyr Lys 
1 5 10 



<210> 30 
<2U> 9 
<212> PRT 

<213> Homo sapiens 
<400> 30 

Aan He Lys Phe Arg Pro Gly Ser Val 
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l 



S 



<210> 31 
<211> 9 
<212> PRT 

<213> Homo sapiens. 
<400> 31 

Thr Leu Ala Phe Arg Glu Gly Thr lie 



<210> 32 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 32 

Gin Cys Arg Arg Lys Aan Tyr Gly Gin Leu 
15 10 



<210> 33 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<400> 33 

Glu Ala Ala Ser Arg Tyr Asn Leu Thr 
1 5 



<210> 34 

<211> 27 

<212> DMA 

<213> Homo sapiens 

<400> 34 

gccttgggct ccaccgeccc tccagtc 

<210> 35 

<211> 27 

<212> DNA 

<213> Homo sapiens 

<400> 35 

ttcctgtctt ttcacatttc aaacctc 



<210> 36 

<211> 27 

<212> DNA 

<213> Homo sapiens 

<400> 36 

actctggcce cggccacgga accagct 27 



1 



5 
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<210> 37 

<211> 27 

<212> DNA 

<213> Homo sapiens 

<4 00> 37 

agcctctctt acacaaaccc ageagtg 



<210> 38 

<211> 27 

<212> DNA 

<213> Homo sapiens 

<4 00> 38 

tcggteccag tcaccaggec agccctg 



<210> 39 

<211> 27 

<212> DMA 

<213> Homo sapiens 

<400> 39 

ggggtgccag gctggggcat cgcgctg 

<210> 40 

<211> 27 

<212> DNA 

<213> Homo sapiens 

<400> 40 

gccccgggct ccaccgcccc cccggcc 



<210> 41 

<211> 27 

<212> DNA 

<213> Homo sapiens 

<400> 41 

gctgtgagta tgaccageag cgtactc 

<210> 42 

<211> 27 

<212> DNA 

<213> Homo sapiens 

<400> 42 

gcagtggcag ecacttctge caacttg 

<210> 43 

<211> 27 

<212> DNA 

<213> Homo sapiens 

<400> 43 

aggccaggat ctgtggtggt acaattg 
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<210> 44 

<221> 27 

<212> DNA 

<213> Homo sapi ns 

<400> 44 

gtgccaggct ggggcatcgc gctgctg 

<210> 45 

<211> 27 

<212> DNA 

<213> Homo sapiens 

<400> 45 

tctgeatcag gctcagettc taetctg 

<210> 46 

<211> 27 

<212> DNA 

<213> Homo sapiens 

<400> 46 

tctgcaggta atggtggcag cagcctc 

<210> 47 

<211> 27 

<212> DNA 

<213> Homo sapiens 

<400> 47 

gecttggetg tctgtcagtg ccgccga 

<210> 48 

<211> 27 

<212> DNA 

<213> Homo sapiens 

<400> 46 

gggcagctgg acatctttco agcccgg 

<210> 49 

<2L1> 30 

<212> DNA 

<213> Homo sapiens 

<40Q> 49 

actctggtgc acaacggoac ctctgocagg 

<210> 50 

<211> 27 

<212> DNA 

<213> Homo sapiens 

<400> 50 

accacacttg coagceatag caeeaag 
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<210> 51 

<211> 27 

<212> DMA 

<213> Homo sapiens 

<400> 51 

ggtgtcacct cggccccgga eaecagg 



<210> 52 

<211> 27 

<212> DMA 

<213> Homo sapiens 

<400> 52 

ggtgtcacct cggccccgga caacagg 

<210> 53 

<211> 30 

<212> UNA 

<213> Homo sapiens 

<400> 53 

atttataaac aagggggttt tctgggcctc 

<210> 54 

<211> 30 

<212?" DMA 

<213> Homo sapiens 

<400> 54 

tttttcctgt cttttcacat tteaaacctc 



<210> 55 

<211> 30 

<212> DNA 

<213> Homo sapiens 

<400> 55 

ggttttctgg gcctctccaa tattaagttc 



<210> 56 

<211> 30 

<212> DNA 

<213> Homo sapiens 

<400> 56 

tactaccaag agctgcagag agacatttct 



<210> 57 

<211> 27 

<212> DNA 

<213> Homo sapiens 

<400> S7 

atttctgaaa tgtttttgca gatttat 
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<210> 58 

<211> 30 

<212> DMA 

<213> Homo sapiens 



<400> 58 

atttctgaaa tgtttttgca gatttataaa 



<210> 59 

<211> 30 

<212> DMA 

<213> Homo sapiens 

<400> 59 

tctetggaag atcccagcac cgactactac 



<21D> 60 

<211> 27 

<212> DKA 

<213> Homo sapiens 

<400> 60 

gacgtggaga cacagttcaa tcagtat 



<210> 61 

<211> 27 

<212> DKA 

<213> Homo sapiens 

<40Q> 61 

aatattaagt tcaggccagg atctgtg 

<210> 62 

<211> 27 

<212> DMA 

<213> Homo sapiens 

<400> 62 

actctggcct tccgagaagg taccatc 

<210> 63 

<211> 30 

<212> DNA 

<213> Homo sapiens 

<400> 63 

cagtgccgcc gaaagaacta cgggcagctg 



<210> 64 

<211> 27 

<212> DNA 

<213> Homo sapiens 

<400> 64 

gaagoagcct ctcgatara 
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